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Abstract 

Stochastic orders are binary relations defined on probability distributions which capture intuitive no- 
tions like being larger or being more variable. This paper introduces stochastic ordering of instantaneous 
SNRs of fading channels as a tool to compare the performance of communication systems over different 
channels. Stochastic orders unify existing performance metrics such as ergodic capacity, and metrics 
based on error rate functions for commonly used modulation schemes through their relation with convex, 
and completely monotonic (cm.) functions. Toward this goal, performance metrics such as instantaneous 
error rates of M-QAM and M-PSK modulations are shown to be cm. functions of the instantaneous SNR, 
while metrics such as the instantaneous capacity are seen to have a completely monotonic derivative 
(c.m.d.). It is shown that the commonly used parametric fading distributions for modeling line of sight 
(LoS), exhibit a monotonicity in the LoS parameter with respect to the stochastic Laplace transform 
order. Using stochastic orders, average performance of systems involving multiple random variables 
are compared over different channels, even when closed form expressions for such averages are not 
tractable. These include diversity combining schemes, relay networks, and signal detection over fading 
channels with non-Gaussian additive noise, which are investigated herein. Simulations are also provided 
to corroborate our results. 

I. Introduction 

Given the vast number of wireless systems with different purposes operating over fading 
channels, it is of interest to know how to decide whether one communication channel is superior 
to another. The performance of such systems are quantified by averaging a metric (e.g. bit 
or symbol error rates, or channel capacity) over the distribution of the random channel. Very 
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often, when one channel is better than another in terms of a particular metric, it is also better 
with respect to another metric. However, this is not always true. Traditionally, answering this 
question has relied on single parameter comparisons between channels using characteristics 
such as diversity order, "amount of fading", Ricean factor and others [1]. These are parametric 
approaches that quantify how much fading the channel exhibits, but do not provide a unified 
framework to compare channels across many different performance metrics. In this work, we 
propose to use stochastic orders to address this issue. 

The theory of stochastic orders (or dominance) provides a comprehensive framework to 
compare two random variables (RVs) or vectors [2]. The simplest and most widely used stochastic 
order compares the cumulative distribution functions (CDF) of two RVs, which defines a partial 
order between pairs of RVs. When the RVs represent instantaneous SNRs in a fading envi- 
ronment, this corresponds to comparing the outage probabilities in a wireless communication 
context. There are many other stochastic orders that capture comparisons of RVs in terms of 
size, and variability. Different than the related majorization theory [3], [4], which defines a 
partial order on deterministic vectors, stochastic orders apply to random variables. Stochastic 
ordering has become an indispensable tool in many increasingly diverse areas of probability and 
applied statistics over the past sixty years. Examples of such areas include reliability theory [5], 
actuarial sciences [6], risk analysis [7], economics [8], and stochastic processes [9]. However, 
the applications of this set of tools in physical layer wireless communications are surprisingly 
very few, although it has found numerous applications in communication networks (please see 
[9], [2, Ch. 13-14] and references therein). 

We now review the limited literature on the applications of stochastic orders in physical 
layer communications. Bounds on the per cell sum rate under arbitrary fading in the high SNR 
regime have been obtained using the aforementioned outage-based "usual stochastic order" in 
[10]. Stochastic ordering has also been applied to obtain bounds on the outage probability in 
Bluetooth piconets under Ricean fading in [11]. In [12], the usual stochastic order is used 
to bound monotone performance metrics in Ricean fading environments with beam selection. 
Reference [13] shows that stochastic ordering of the SNR between the sender and any two 
receivers is sufficient for the existence of a degraded channel in a layered erasure broadcast 
channel modeled using the binary expansion model. 

To the best of our knowledge, there is no systematic exploitation of the general stochastic 
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ordering theory which can be used to provide a means for comparing wireless systems. All the 
above references use the usual stochastic order, which can be interpreted as a comparison of 
the outage probabilities, and do not exploit the full gamut of stochastic orders available [2]. 
In this paper, we give a wide range of examples illustrating how different stochastic orders 
are appropriate for comparing systems using different metrics with analytical properties such 
as monotonicity, convexity, and complete monotonicity, which shed light into the connections 
between performance metrics such as error rates and ergodic capacity. Additionally, we find the 
conditions for the preservation of inequalities satisfied by the averages of performance metrics 
of individual systems, when multiple such systems are combined. These may be combinations 
in parallel, in series, or otherwise, as may be seen in relay networks. Such a study permits the 
comparison of performance of systems, even in settings where closed-form expressions are not 
tractable. 

II. Stochastic Ordering Preliminaries 

The literature on stochastic ordering, primarily in reliability theory and statistics, delineates 

numerous stochastic orders, many of which fall under the subclass of integral stochastic orders. 
We begin with a short description of the theory of integral stochastic orders, which can be found 
in [2], [6]. 

Let denote a class of real valued functions g : R"*" — > R, and X and Y be RVs with CDFs 
Fx {•) and Fy {•) respectively. We define the integral stochastic order with respect to as [6]: 

X<^Y^E [g{X)] < E [g{Y)] , Wg e <3. (1) 

In this case, 6 is known as a generator of the order <©. A stochastic order can have more than 
one generator. For a given stochastic order, it is of interest to identify "large" generators which 
are useful in identifying the equivalence of two orders. The largest generator set of functions for 
a stochastic order contains all other generators, and is termed the maximal generator [6]. It is also 
of interest to find "small" generators which specify necessary conditions for the ordering of two 
RVs. We now give three examples of integral stochastic orders by specifying the corresponding 
generator set of functions 0. 
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A. Usual Stochastic Order 

The usual stochastic order compares the magnitude of two RVs. In this case a small generator 
is the set of all non-decreasing indicator functions: = {g{x) : g{x) = I[x > p],p e M}. 
From (1) it follows that this order is equivalent to comparing the CDFs of the RVs. Formally, 
we write 



To see the interpretation of this in the context of wireless channels, consider two channels to be 
compared, with effective channels X :— and Y :— where and correspond 

to the complex channel gains of two wireless systems. The usual stochastic ordering of X and 
Y is equivalent to comparing their corresponding outage probabilities for all outage thresholds. 
The maximal generator for the usual stochastic order is the set of all increasing functions [6]. 
As a result, with the choice g{x) = a; in (1), we obtain E [X] < E [Y] whenever X <Lt Y, 
which agrees with the intuition that a larger random variable must have a larger mean value. 

B. Convex Order 

In this case is the set of all convex functions, and the order is denoted as X <cx Y. Since 
g{x) = X and g(x) = —x are both convex, from (1), we have E [X] = E [Y] whenever X and Y 
are convex ordered. Therefore, convex ordering establishes that the RVs have the same mean and 
X is "less variable" than Y. Clearly, in the fading context, this can be used to identify channels 
with "less fading". Since <cx is a measure of variability, one would expect that a degenerate 
RV is less in the convex sense than any other RV with the same mean. Indeed, this is the case: 
If Fxix) ^ I[x> ij], then X <ex Y for all RVs Y with E [Y] = So the degenerate RV has 
an absolute minimum dispersion, as measured by the convex order, which is a consequence of 
Jensen's inequality. 

Many performance metrics, such as channel capacity, error rates for different modulations [14] 
and coding schemes in wireless systems are convex or concave functions of the instantaneous 
SNR. Therefore, establishing convex ordering of two RVs can help us qualitatively measure 
the relative average performance of the corresponding systems. Note that if instead of convex 
functions, the class & is chosen as the set of all concave functions, one would get the same 
order with a reversal in the inequality. 




(2) 
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Verifying the usual stochastic ordering of two RVs is straightforward through (2). What follows 
are easily testable sufficient conditions for X <cx Y- Let S~ {g{x)) denote the number of sign 

changes of g{x) as x increases over [0, oo), then X <cx F if E [X] = E [Y] and any of the 
following hold [2]: 

S~ {fy (x) — fx (x)) = 2 and the sign sequence is +, — , +. (3) 
S~ {Fy (x) — Fx (x)) = 1 and the sign sequence is +, — , (4) 

where /x (■) and fy {■) are the probability density functions (PDFs) of X and Y respectively. 

Interestingly, to the best of our knowledge, although convex ordering of RVs is widely used 
in many other areas, it has never been used in physical-layer wireless communications. 

C. Laplace Transform Order 

Similar to <st and <cx, it is possible to order random variables based on their Laplace 
transforms (LT). In this case, (5 = {g{x) : g{x) = — exp {—px) , p > 0}, so that 

^ <Lt ^ ^ E [exp(-yp)] < E [exp(-Xp)] , V p > . (5) 

Interpreting exp(— px) as being proportional to the instantaneous error rate (as in the case for 
differential-PSK (DPSK) modulation and Chemoff bounds for other modulations), LT ordering 
of the instantaneous SNRs in (5) can be interpreted as an inequality in the average error rates 
satisfied at all values of SNR p. One of the powerful consequences of LT ordering is that 

X<uY^E [g{X)] > E [g{Y)] , (6) 

for all completely monotonic (cm.) functions g{-) [2, pp. 96]. A similar result with a reversal 
in the inequality states that 

X<^,Y^W. [g{X)] < E [g{Y)] , (7) 

for all functions g{-) that have a completely monotonic derivative (c.m.d) function. Recall 
that the derivatives of a cm. function alternate in sign. Also, they can be written as a pos- 
itive mixture of decaying exponentials. More formally, a cm. function by definition satis- 
fies (—1)" (Pg{x)/dx'^ > 0, for X > and n — 0,1,2,..., which by the celebrated Bern- 
stein's theorem is equivalent to the existence of a positive function //(•) such that g{x) — 
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oo 

J exp {—xu) fj,{u) du [2, pp. 96]. It can be easily verified that cm. (c.m.d) functions are convex 



(concave) and decreasing (increasing). Furtlier, if gi{x) is completely monotonic, and g2{x) > 
has a completely monotonic derivative, then the composition gi {g2{x)) is completely monotonic. 

It is useful to mention that for any two RVs X and Y, X <cx Y ^ Y <Lt X, which follows 
from the fact that — exp(— px) is concave in x for any p > 0. Hence, convex ordering provides 
a method to generate or verify LT ordering between two RVs. Indeed either of the conditions 
(3) or (4) together with equal mean values for X and Y imply that X <u Y. Further, observe 
that X <st Y ^ X <Lt Y, which follows since — cxp(— px) is increasing in x for p > 0. 

In the rest of the paper, we illustrate the power of the stochastic ordering framework in 
comparing wireless channels and systems. We will investigate the convexity and complete 
monotonicity properties related to the error rate and capacity expressions in Section EI, which 
will facilitate comparing the average performances of systems by using (6) and (7). In Section 
rv we identify commonly used channel distributions which are LT or convex ordered. Section V 
investigates the conditions under which these stochastic orders are preserved in complex systems 
where the performance of their constituent parts satisfy an order. Finally, relevant simulations 
to supplement the theoretical results are provided in Section VI. 

III. Ordering of Average Error Rate and Ergodic Capacity Metrics 
A. Symbol Error Rate 

It has been established in [15] that the error rate of binary signaling in the presence of noise 
with a uni-modal and differentiable PDF is a convex function of the SNR when maximum 
likelihood decoding is performed. Also, it is known that the instantaneous error probabilities 
of all one-dimensional and two-dimensional constellations with ML decoding in the presence 
additive white Gaussian noise (AWGN) is a convex function of the SNR [14]. In this section, we 
go one step further and establish the complete monotonicity of some two-dimensional modulation 
schemes, which will be useful in establishing inequalities between averaged performance metrics. 
It is well known that Q (y/x) is cm. [16], from which the complete monotonicity of the 
instantaneous error rate of the form (px) = aQ [\/hpx) easily follows, for a, 6 > 0. Here 
a and b are modulation dependent constants which can be chosen to get exact performance in 
some cases (e.g. a = 1, 6 = 2 for BPSK), or approximations in others (a = 3/4,6 = 4/5 for 
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16-QAM). For the exact case, it follows from (6) that E [P^ (pY)] < E [P^ (pX)], for p > 
whenever X <Lt Y- 

We now establish, for the first time, the complete monotonicity of exact symbol error rates 
of square M-QAM and Af-PSK modulations which are not in the form P^ (px) = aQ (y^Spx), 
and offer sharper results than those mentioned above, since they do not rely on approximations. 

The M-PSK symbol error rate is given by the following [1, pp.195]: 

(M-l)7r/M 

PP^'ip^)-- f e.p(-px^)de, (8) 





.2/ 



where ckpsk sin (tt/M). After a change of variables, (8) can be expressed as the Laplace 
transform of a positive function: 



oo 



I 



n, > "PSK 

^ - sin2((M-l)^) 



27r J Uy/U - CKpsK 



)PSK 



du , (9) 



which together with Bernstein's Theorem suggests that P^ (px) is cm.. 
Consider now the square M-QAM error rate function [1, pp.195]: 

P^^"^ (px) = aQ (V"QAMpa;) - bQ^ (v«QAMpa;) , (10) 



where q;qam := 3/(M - 1), a := 4(VM - 1)/VM and b := a'^/A. Note that < 6 < a. We 
claim that (10) is cm. for any a, b such that 6 < a. To see this, recall 

7r/2fc 

Q'{V^)=- f exp(-— ^) de, (11) 
TT 7 \ 2 sm / 



for A; = 1, 2 [1]. After a change of variables similar to (8), we obtain 



oo 



2n J 



al [0.5 <u <1] ^{a-b)I [u > 1] 
u\j2u — 1 u\j2u — 1 



du , (12) 





which is also cm. by Bernstein's theorem, since & < a. In conclusion, whenever X <Lt Y , 
E \P^ {pY)\ < E [Pg (pX)] for all average SNR p, where P^ (•) could be given by either (8) or 
(10). This follows from the definition of the LT order and the cm. properties of instantaneous 
error rates of M-QAM or M-PSK modulations. 
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B. Ergodic Capacity 

We now show that stochastic ordering of instantaneous effective channel distributions implies 
that their ergodic channel capacities satisfy a corresponding inequality at all average SNRs. We 
begin with the case where only the receiver has channel status information (CSI). 

1 ) Ergodic Capacity with Receive CSI only: The instantaneous capacity, conditioned on the 
effective channel X — x when only the receiver has CSI is given by C{px) — log(l + px), 
where p is the average SNR. Since dC{px)/dx = p/(l + px) is cm. in x, from (7) X <Lt Y 
implies that the ergodic capacities satisfy Ex \C {pX)] < Ey [C {pY)] for p > 0. Recall that by 
(5), LT ordering of the channels X and Y can be interpreted as a comparison of the average 
error rates, when the instantaneous error rate is a decaying exponential. As a result, one can 
loosely say that if the average error rates of two channels X and Y satisfy the inequality (5) 
at all SNRs, then so do the ergodic capacities. Surprisingly, however, the converse is not true, 
as we now illustrate. Consider a Pareto-type distribution, which is appropriate for modeling the 
instantaneous SINK in the presence of interference [17] : 

Fx(z)^j^^^^,z>0, ^>0. (13) 

Using integration by parts and simplifying, we obtain 



Ex[C(pX)]^/ ,^^^^-^^^,^ d.. (14) 







Taking the derivative with respect to /3, it is seen that Ex [C (pX)] is a decreasing function of 
f3, for p > 0. This shows that for f3^ < (3^, Ex [C {pX)] > Ey [C {pY)] for p > 0. On the other 
hand, since Fx (z) — z^ + o{z^) near z — 0, the average symbol error rate for an exponential 
instantaneous error rate function satisfies E [exp(— pX)] = {Gcp)~^ + o(p~'^), where Gc is 
the array gain and ^ is the diversity order [18]. Hence, as ^ increases, the high-SNR average 
error rate decreases, while the capacity also decreases at all SNR p! Interpreting the ergodic 
capacity as what is achievable by coding over an i.i.d. time-extension of the channel, we reach 
the conclusion that even though Y offers more diversity than X for an uncoded system, the i.i.d. 
extension of X lends itself to more diversity than that of F. To put it more simply, at high SNR, 
it is possible for one channel to be superior to another in terms of error rates in the absence of 
coding, while being inferior when the capacity achieving code is employed over both channels. 
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2) Channel Inversion and Delay-Limited Capacity: When CSI is available at the transmitter, 
it can be used for power adaptation. A simple, suboptimal approach is to "invert" the channel 
at the transmitter, so that effectively the receiver sees a non-fading AWGN channel. Such an 
approach is viable only when E [X^^] < oo, leading to a finite average transmit power. This is 
the case whenever the channel offers a diversity order strictly greater than one. Channel inversion 
has the advantage that a channel code designed for the AWGN channel can be used effectively, 
and that the code length need not depend on the channel coherence time to average out the 
fading. This "delay-limited" approach [19] gives rise to an achievable rate given by 

cUp) = log (l + . (15) 



E [X-i] 

Clearly, since g{x) = x^^ is a cm. function of x, E [X~^] > E [V^^], whenever X <Lt Y. This 
implies Ccj(p) < Cqi{p) for all p, since Ccj(p) in (15) is a decreasing function of E 

3) Optimal Power and Rate Adaptation (OA): It is well known that CI is not optimal, since 
when the channel gain becomes arbitrarily small, the transmitter uses extremely high power. To 
overcome this limitation, the optimal power and rate adaptation scheme is proposed in [19], 
where water-filling across time is performed subject to an average transmit power constraint. 
The capacity so obtained over a channel with instantaneous SNR X is given by [19] : 

oo 

Coa{p)= J log(^) d[^-^x{z)] , (16) 

zt{p) 

where zt{p) is the signaling threshold, which is implicitly governed by the power constraint as 
follows: 

CO 

^ dFx{z)=p. (17) 



zt{p) z 



^ y 

It can be shown that X <Lt Y does not guarantee Cq^{p) < Coa(p) for P- However, in 
what follows, we will show that under the stronger assumption X <st Y, Coa(p) ^ ^opXp) for 
all p. 

Using integration by parts on (17), it is observed that for X <st Y, we have z^{p) > zj {p). 
Now, integrating (16) by parts, under the assumptions that lim (1 — Fx {z)) \og{z/zf{p)) = 

z— >oo 
X Y 

and lim(l — Fy (z)) \og{z / zJ (p)) = 0, it is seen that Coa(p) ^ C'oa(p) P > 0, since 

z—>-oo 

zfip) > zj{p). Therefore, X <,,Y =^ CqM < cIm, for p > 0. 
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IV. Ordering of Parametric Fading Distributions 

We now proceed to show that commonly used parametric fading distributions are completely 
monotonic in the line of sight parameter with respect to LT and convex orders. 

A. Nakagami Fading 

Consider Nakagami fading model, where the envelope \/X is Nakagami and the effective 
channel X is Gamma distributed [20], with PDF given by 

fx {x) = ^f^x"^-^ exp(-mx) , x > . (18) 
i (m) 

Since E [exp(— pX)] = (1 + p/niy^ is a decreasing function of m for each p, it follows that 
if the m parameters of two channel distributions satisfy < mX, then X <u Y, where X 
and Y have normalized Gamma distributions with parameters m-^ and respectively. This 
shows that for example, all the performance metrics in (8) or (10) that are cm. have averages 
over fading distributions that satisfy the inequality Ey [P^ (pY)] < Ex [P^ (pX)] for all values 
of average SNR p. A similar claim with a reversal in the inequality can be made for the ergodic 
capacity metric. Note that the PDFs of X and Y in (18) are defined to satisfy E [X] = E [Y] = 1, 
independent of the fading parameter m. Hence, the improvements in error rate or ergodic capacity 
at all values of p with increased m is not due to an improvement in average SNR. A stronger 
convex ordering result can also be established. Since E [X] — E [y], < =^ Y <cx X 
can be shown by using (3). We can summarize the results herein by using the terminology that 
the normalized Gamma distribution is monotonically increasing in m with respect to the orders 
<Lt and <cx. 

B. Ricean Fading 

As in the Nakagami case, the Rice distribution will also be shown to be monotonic in the 
LoS parameter K with respect to the orders <Lt and <cx- The instantaneous SNR distribution 

is given by 

/x (x) = (1 + X) exp(-X) exp [-(X + 7o (2a/X(xT1>) , (19) 

where /o(-) is the modified Bessel function of the first kind of order zero. Clearly, E [X] = 1 
is independent of K. Taking the Laplace transform of (19), we have E [exp{—Xp)] = (1 + 
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K) / (1 -\- K -\- p) exp [Kpl {1 -\- K -\- p)], which decreases with K for each p. This impUes that, 
similar to the Nakagami case, increasing K without increasing the average SNR improves the 
average symbol error rate, ergodic capacity, or any average metric obtained from a cm. or c.m.d. 
function. Thus, if < are the Ricean parameters of two channels, then X <Lt Y ■ Similar 
to the Nakagami case, equation (3) can be used to establish a stronger claim that Y <cx X. 

In this specific Ricean context, similar results for the ergodic capacity are found in [21] and 
the references therein, in a more general MIMO setting. However, in these results, either the 
channel power increases with an increase in the LoS component, or only an asymptotically large 
number of antennas is considered. 

V. Communication Systems Involving Multiple RVs 

In the following discussion, we will consider systems involving multiple independent random 
channel coefficients and compare their performance in two different sets of channels, where the 
effective channels associated with the first system are denoted by X := [Xi, . . . ,Xm\ while 
those of the second channel by Y := [Fi, . . . , Ym\. Toward this goal, we use the following result 
[2, pp. 97], which shows that LT ordering is preserved by multivariate functions that are c.m.d.: 

Theorem 1. Let Xi, . . . , Xm be independent and li, . . . , Ym also be independent. If X^ <Lt 
Ym for m — 1, . . . , M, then g {Xi, . . . , Xm) <Lt 9 {Yi, ■ ■ ■ , Ym) for all functions g : — >■ 
such that for m — 1, . . . ,M, (d/dxm) g {xi, . . . , xm) is cm. in Xm, when all other variables 
are fixed. 

We now investigate the systems for which the combined instantaneous SNR is given by a 
function gr(x) := g{xi, . . . , xm), which satisfies the conditions of Theorem 1. Unless otherwise 
mentioned, we will assume throughout that X^ <Lt for m = 1, . . . , M. 

A. Maximum Ratio Combining 

Consider a SIMO diversity combining system with M receive antennas which have complete 
CSI. If maximum ratio combining (MRC) is performed, conditioned on the effective channels 
Xm = for m = 1, . . . , M, the instantaneous SNR at the output of the combiner is proportional 
to 

M 

Smkc (x) = ^Xra, (20) 
m=l 
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which satisfies the conditions of Theorem 1 as easily seen by taking derivatives. Thus, we infer 
that when MRC is performed, (X) <Lt S'mrc 0^)- Consequently, whenever (px) is cm. 

and C{px) is c.m.d., the average error rates satisfy Ey [P^ {P9mrc C^))] ^ [Pc (Pfi'MRc (^))] 
for all p, and the ergodic capacities satisfy Ey [C{pg^j^c (Y))] > lEx [C(P5'mrc (X))]' ^r all p. 

B. Equal Gain Combining 

Next, assume that the SIMO diversity system adopts equal gain combining (EGC) at the 
receiver. In this case, conditioned on the instantaneous effective channels — Xm for m — 
1, . . . , M, the instantaneous SNR at the combiner is proportional to 



The derivative (d/dxi) g^^^^ {xi, . . . ,xm) = M M ^ ^/x^ ) / ^/x^ is a cm. function of Xi, for 



i — 1, . . . , M. Therefore, equal gain combining of a better set of branches results in a better 
system overall, as also expressed more rigorously after (20) in the MRC example. 

C. Selection Combining 

In contrast to the previous two examples, this example shows that even though the individual 
branch instantaneous SNRs are LT ordered, the combined SNR at the output of the combiner need 
not be LT ordered. For selection combining (SC), conditioning on the instantaneous effective 
channels — Xm ior m — 1, . . . , M , we have 



which is not differentiable, and hence is not cm.. In fact, Xm <Lt Ym, "7, = 1, . . . , M does 
not imply max^-^m <Lt maxmV'm- We provide a simple counterexample in Section VI. This 
shows that even though channels Ym provide better average error rates at all p than X^, for 
m = 1, . . . , M for a SISO system, the composite SC channel does not. 

D. Multi-hop Amplify and Forward (AF) 

Consider a multi-hop system with M links subject to AWGN, where Xm is the effective 
channel gain over the m*'' link. It is assumed that the m*^ node has channel information of 




(21) 




(x) = maxx. 



(22) 



m 
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the (m — 1)*'* hop, for m = 2, . . . , M, and the ampUfication factor for each node is the same. 
Conditioned on the instantaneous effective channels Xj^ = for m = 1, . . . , M, the SNR at 
the destination in this case is proportional to [22]: 

-1 

(23) 

Taking the partial derivatives of S'mh-af (^) with respect to each x^j^ for m = 1, . . . , M, it is seen 
that fifMH AF (^) satisfies the conditions of Theorem 1. Thus, ^fMH af ^Lt 5'mh af 0^)- 
a result, the average error rates for the multi-hop AF system satisfy Ey [Pc (p5'mh-\f C^))] — 
Ex [Pq (pA'mh-af (-^))]' p > 0. Importantly, a closed-form expression for the average per- 
formance of this system is not tractable for most practical channel distributions. Despite this, it 
is still possible to compare the error rates of two otherwise identical systems systems with two 
sets of LT ordered effective channels at all average SNRs. 



m=l 



E. Multi-hop Channels with Decode and Forward 

Consider an M-hop channel, where each terminal decodes a received symbol into a binary 
alphabet and forwards it over to the next terminal. Let the instantaneous error rate over the 



i*'* link be given by P ipxA , i — 1,. . M, where we assume < P (x) < 1/2 is cm.. 

i i 

For convenience, we define Xi.^ := and let P^^ (pXi.^) be the combined 

instantaneous error rate of the first 1 < m < M hops. We have the following: 

Theorem 2. Let Xi, . . . , Xm be independent, and likewise for Yi, . . . , Ym- Suppose X^ <Lt Ym 
/or m = 1, . . . , M. Then Ex,^ \P. (pXi:^)l > Ey,^ \p. (pY,..^)] , m = 1, . . . , M. 

|_ l:m J |_ l:m J 

Proof: See Appendix. ■ 
Note that Theorem 2 and its proof carry over when each hop adopts M-ary modulation as 
well, provided that p is large enough to ensure < Exi^^ P^^ (pXi:^) < 1/2. 

F. Post Detection Combining 

Consider an M-antenna post-detection combining (PDC) scheme, where the instantaneous 
symbol error rate on the m^^ branch is P^ {pxm) and is cm. as in the previous example. The 
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instantaneous probability of error of the PDC system is given by^ [23] : 

^e,^ (pxi:m) = e e (n iP^d] I n (i - ^s- 



(24) 



where Sk is a set running over all subsets of {1, . . . , M} with k elements. Taking expectation 
with respect to ^i-.m, which is assumed to have independent components, we have. 



M 



k=M±i Sk \ieSk 




(25) 

Clearly, the average error rate is an increasing function of any of the Ex^ [-Pg^ (pX^)], since it 
is not possible to get improved performance by increasing the average error rate on any particular 
link. This shows that when X^^ <Lt Y^, form— 1, .... M, and P„ (px) is cm., so that the 
average error rates of PDC satisfy Ex [P^^ (pX^)] > Ey [P^ (p^m)] for p > 0, it follows that 



E, 



for p > 0. 



G. Generalized Multi-branch Multi-hop AF Cooperative Relay Networks 

We now consider the generalized relay structure illustrated Fig. 1, which consists of M 
independent branches, each involving Nm relays, for m — 1,...,M, which assist the direct 
link between the source S and the destination D by performing amplify and forward (AF). It is 
assumed that all the links are impaired by AWGN with fixed variance. This model requires the 
branches to communicate through mutually orthogonal channels, so that M independent copies 
are available at the destination which performs MRC (using combining coefficients given in 
[24]). Although approximate expressions for the error rate have been obtained for the case of 
Ricean fading in [24], closed-form expressions are intractable. 

Note that the two-hop fixed AF relay, which finds frequent application in cooperative diversity 
literature [24] and illustrated in Fig. 2 is a special case of this general relay, with M = 1 and 
Nrn = 1. Thus, the forthcoming results obtained for the general case apply for the two-hop relay 
as well. 

We now show that the exact average symbol error rate can be compared over a number of 
fading distributions where the pairs of effective channels are LT ordered. To this end, we show 



'We assume M is odd. Extensions to even M are straight-forward by adding a tie breaker term to (24). 
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that the output SNR of the MRC combiner at the destination satisfies the conditions of Theorem 
1. Let denote the effective channel on the direct link, and Xm,n the effective channel at the 
n*'* hop on link m. Since the destination performs MRC, the instantaneous output SNR is the 
sum of individual end-to-end branch SNRs, each of which are given by (23). Thus, conditioned 
on Xm,n = Xm,n for m = 0, . . . , M and n = 0,...,N^, and defining ^^b-mh-af (X) — 
A'mb-mh-af ^-^0,0 ' -^1,0 ) ■ ■ ■ ) -^i.jvi ) ■ ■ ■ ) Xm,nm ^ ' have 

n 1 



MB-MH-AF 



(X) 



M 

E 



n=l 



1 



1 



+ 2^0,0 ■ 



(26) 



As shown in the arguments following (23), the summand in the RHS of (26) has a cm. deriva- 
tive in each variable. Combining this with Theorem 1, we have Ey [P^ (pA'mb-mh-af (Y))] < 

Ex [Pe {P9 MB-MH-AF 

(X))] for p > 0. 



H. Combined Multipath Fading and Shadowing 

It is well known that the effect of shadow fading on the instantaneous SNR distribution can be 
modeled as a product of a shadowing random variable with a multipath fading random variable 
[1]. Let Xi <Lt Yi be the two multipath fading SNR distributions, and X2 <Lt ^2 be the 
two shadowing distributions. Then, from Theorem 1, it follows that the composite RV satisfies 
X1X2 <Lt ^1^2, since g{xi, X2) = xiX2 has a cm. derivative in each variable. We conclude that 
IExi.X2 [Pq (P-^1-^2)] > lEy^ [Pp {PY1Y2)] , Vp, whenever P^ (■) is cm.. Such conclusions can 
be drawn even in cases where the composite distribution of X1X2 or Y1Y2 cannot be written in 
closed-form. 

/. Systems with non-Gaussian Channel Noise 

In this discussion, we assume the following system model: 

Z = ^/pXS + W , (27) 

where for simplicity, 5" e {— 1, 1}, pX is the instantaneous SNR, p the average SNR, and W is 
non-Gaussian noise. 
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1) Gaussian Mixture: In this model, W represents compound Gaussian noise (also called 
Gaussian mixture), which can be written asW = \/AG, where A is a positive valued RV, which 

represents the scale of G, and G ~ A^(0, 1). Such a formulation is possible for symmetric alpha- 
stable noise, Middleton class-A noise, as well as other compound Gaussian noise distributions. 
The error rate conditioned on the effective channel X = x is given by 



which is a cm. function of x as can be verified by differentiating inside the expectation with 
respect to x. Using (6), this shows that when X <Lt Y then the average error rates satisfy 
Ex [-Pg (P^)] — [Pq (p^)]' s^cn for mixed (compound) Gaussian noise. Similar results can 
also be shown to hold for noise distributions such as the Laplace distribution which cannot be 
expressed as a compound Gaussian. 

2) Bounded Noise: Recall the system model from (27). If \W\ < C for some constant C, 
almost surely then Fw {x) — 1 for x > C and 1 — F^r [^/2x) = for > C. It is clear 
from Bernstein's theorem that a function, such as 1 — Fw (v^2a;) with bounded support cannot 
be cm.. From this, we can conclude that if the noise is bounded, it is possible for two SNR 
distributions to be LT ordered, although Ey ifiY)] need not be less than Ex [P^ {p^)] for all 
p > 0. This negative result emphasizes the effect of the noise distribution in claims of ordering 
and concludes our discussion of systems with non-Gaussian noise. 



We now corroborate our theoretical results using Monte-Carlo simulations. For ease of no- 
tation, we define (p) := Ex [P^ {p^)] to denote the average error rates of SISO systems 
operated in the effective channel X. Also, we use (p) :— Ex [Pf, {pg (X))] to represent the 
average error rates of systems involving multiple effective channel coefficients. 

One of the central results of Section III-B is that it is possible for one channel to be superior 
to another (in terms of error rates) at high SNR in the absence of coding, while being inferior 
when the capacity achieving code is used over both channels. This is illustrated in Fig. 3, which 
shows the comparative error rate performance of DPSK employed over an interference dominated 
fading channel with Pareto type distributed instantaneous SINR (having parameters = 2 and 

__Y 

= 5. Clearly, since P^ (p) < P^ (p) for p < —0.5 dB and vice-versa for p > —0.5 dB, 




(28) 



VI. Simulations 
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the system with effective channel X is not better than that with effective channel Y at every 
average SNR. On the other hand, Fig. 4, shows that the ergodic capacity of the system with 
instantaneous channel X is consistently larger than that when operated in the channel Y with 
parameter (5^ = 5. 

Figures 5, 6 and 7 show the performance of diversity combining schemes such as MRC, EGC 
and SC with L — ?, branches over two sets of i.i.d Ricean fading channels with parameters 
= 2 and = 5. Note that from Subsection IV-B, Xm <Lt l^n for m = 1, 2, 3. The trend 
observed in the performance analysis curves obtained herein can be equivalently obtained for 
any other sets of LT ordered effective channel random variables, using any modulation scheme 
whose error rate is a cm. function of the effective channel. 

In Fig. 5, we demonstrate that LT ordering of the instantaneous SNR distributions for the 
individual branches can be used to compare average error rates when MRC is performed at 
the receiver. For L — ?, receive diversity branches, it is observed that the error rate of BPSK 
in the channel with instantaneous SNR pY is consistently less than that in the channel with 
instantaneous SNR pX, which agrees with the fact that since the effective channel for Ricean 
fading is cm. in K, Xm <Lt Y^, for m = 1, 2, 3, and hence (p) < (p) for p > 0. 

Figure 6 illustrates that when X„i <Lt Ym, for rn = 1,2, 3, we get P^ (p) < P^ (p) for 
p > for the case of EGC employing BPSK. The error rate curves help demonstrate that fading 
channels with larger Ricean parameters offer smaller error rates than those with smaller Ricean 
parameters at all values of average SNR p when EGC is used, as predicted in Subsection V-B. 
Such a conclusion is not present in the literature due to the unavailability of a closed-form 
expression for the average error rate of coherent EGC in Ricean channels, which is applicable 
in all SNR regimes [1]. 

The comparative performance of SC using DPSK symbols is shown in Fig. 7. It is evident 
that although the individual branch SNRs are LT ordered, (p) > pf (p), for p < -0.4 dB, 
while Pg (p) < Pg (p), for p > —0.4 dB. This cross-over point is clearly depicted in Fig. 7 
using a linear scale for the error rate axis, since it is more easily discernible compared to the 
conventional log scale. Hence, selection combining of a better set of channels (in terms of error 
rates) need not yield a better system overall, at low SNR. 

The performance of a multi-hop amplify and forward relay is studied in Fig. 8. We assume 
the model described in Section V-D with M = 3 relays under two different Ricean fading 
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scenarios, one with parameter — 2 and the other with — 5. It is observed that the 
average symbol error rate of Y is consistently less than that of X at all SNRs. This, due to the 
fact that <Lt Y^, m = 1, 2, 3 (p) < pf (p) , Vp. 

Fig. 9 illustrates the comparative performance of an uncoded BPSK system over an additive 
compound Gaussian noise channel subject to two different Ricean fading effects modeled using 
parameters K-^ — 2 and — 5. We show that (p) < P^ (p) for all p > 0, when the noise 
follows a symmetric alpha-stable distribution with a characteristic exponent of 1.6. This shows 
that LT ordering results apply to systems with compound Gaussian noise, since an alpha-stable 
RV can be written as \^G, where G ~ CA/'(0, 1) and A is a positively skewed alpha-stable RV 
[23]. Such results are not found in literature, since a closed-form expression for the average error 
rate of BPSK under Ricean fading with symmetric alpha-stable noise is analytically intractable. 
In fact, even for the special case of K — 1 i.e. Rayleigh fading, a closed-form expression valid 
in the asymptotic high SNR regime is known [23]. 

In direct contrast to the results for the compound-Gaussian noise case, LT ordering of effective 
channels does not imply that the average error rate performance for noise with bounded support 
will satisfy the corresponding inequality at all SNR. In fact, as depicted in Fig. 10, where the 
unit-variance noise is assumed to be uniformly distributed on [— v^, \/3], it is observed that for 
p < 2.6 dB, Pg (p) < Pg (p), while the opposite holds for p > 2.6 dB. This corroborates the 
claim of Subsection V-I2, which states that LT ordering of effective channels does not imply that 

—'Y X 

the average error rates satisfy Pg (p) < Pg (p) for all p > 0, under noise with finite support. 

VII. Conclusions 

In this paper, we illustrate the power of stochastic orders such as the convex order and the LT 
order, which have never been used in physical layer communication/information theory, to relate 
and unify existing performance metrics such as ergodic capacity and error rate functions through 
their relationship with completely monotonic functions. We first identify that the instantaneous 
symbol error rate functions for various signaling constellations such as M-PSK and M-QAM 
are completely monotonic functions of the instantaneous SNR. Recognizing the importance of 
LT ordering of instantaneous SNR distributions, we identify parametric fading distributions such 
as Nakagami and Ricean, which are monotonic in the LoS parameters in the orders <Lt and <cx- 
We also lay the groundwork to find the conditions for the preservation of inequalities satisfied 
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by the averages of performance metrics of individual systems, when multiple such systems are 
combined, even when closed form expressions for such averages are not tractable. These include 
diversity combining schemes such as MRC, EGC and a variety of relay networks. 

In summary, this framework provides a novel approach to compare the performance of a 
vast range of systems on the basis of the analytical properties of the performance metric 
such as monotonicity, convexity, or complete monotonicity, even in settings where closed-form 
expressions are not tractable. 



Appendix 
Proof of Theorem 2 

For any m, viewing the m-hop channel as a series cascade of the first m — 1 hops and the 
m*'* hop, we have the following: 



(29) 



for m = 2, . . . , M. To prove the theorem, we will use induction. Clearly, Theorem 2 holds for 
m — 1. Taking expectation of both sides of (29), we have 



+ 



V -^l:m— 1 



(pX 



l;m— 1 , 



Ex. [Pe^ {pX. 



(30) 



We have E^,^_, 
esis, and Exi,„ 



P (pYi:^_i) by the induction hypoth- 

l:m — 1 ' 



(pXi., 



> E- 



follows because P„ 



is cm. and 



Xm <Lt Ym- The theorem then follows because the RHS of (30) is of the form Pi(l-P2)+^2(1- 
Pi), which is an increasing function of both Pi and P2, since <Pi < 1/2, 0<P2 < 1/2. 
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Fig. 1. Multi-branch multi-hop cooperative relay network. Rm,i ■ ■ .Rm,N^ represent the relays on the m link from the 
source S to the destination D. The corresponding instantaneous effective channel gains are denoted as Xjn,o ■ ■ ■ Xm,N^ ■ 




Fig. 2. Two hop AF cooperative relay network. 
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Fig. 3. Error probability comparison of DPSK modulation, under two different fading scenarios with LT ordered Pareto-type 
SNR distributions, using parameters = 2 and (3^ = 5. 
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Fig. 4. Ergodic capacity comparison of two different fading scenarios with LT ordered Pareto-type SNR distributions, using 
parameters =2 and = 5. C'^ [p) {C^ [p)) represents the ergodic capacity in the effective channel X (F). 
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Fig. 5. Error rate comparison of maximum ratio combining using L = 3 antennas with BPSK. (p) corresponds to the 
average symbol error rate under Ricean fading with parameter K-^ = 2 and (p) to the average symbol error rate under 
Ricean fading with parameter if ^ = 5. 
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Fig. 6. Error rate comparison of equal gain combining using L = 3 antennas with BPSK. (p) corresponds to the average 
error rate under Ricean fading with parameter = 2 and (p) to the average symbol error rate under Ricean fading with 
parameter = 5. 




p(dB) 

Fig. 7. Error rate comparison of selection combining using L = 3 antennas with DPSK. Pg (p) corresponds to the average 
symbol error rate under Ricean fading with parameter K''^ = 2 and (p) to the average S3Tnbol error rate under Ricean 
fading with parameter = 5. 
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Fig. 8. Error rate comparison of M = 3 liop amplify-forward relay with BPSK under Ricean fading. (p) corresponds to 
the average symbol error rate under Ricean fading with parameter K-^ = 2 and (p) to the average error rate under Ricean 
fading with parameter = 5. 




p(dB) 



Fig. 9. Performance comparison of BPSK in compound Gaussian noise (normalized symmetric alpha-stable distribution with 
characteristic exponent 1.6). (p) corresponds to the average symbol error rate under Ricean fading with parameter = 2 
and (p) corresponds to the average symbol error rate under Ricean fading with parameter = 5. 
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Fig. 10. Performance comparison of BPSK in noise with finite support (symmetric uniformly distributed noise with unit 
variance). (p) corresponds to the average symbol error rate under Ricean fading with parameter = 2 and (p) 
corresponds to the average error rate under Ricean fading with parameter A"^ = 5. 
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